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Clean Copy of Amended Claims 



1. An automated method for setting up a natural language interface in a 
Web site comprising the steps of: 

defining a hierarchy of topics into which individual documents or 
Web pages can be classified; 

generating a keyword index for those documents; and 
for each topic in the hierarchy, associating a set of n-grams to a 
topic in the topic hierarchy, which set of n-grams is distinctive to that topic 
and wherein the n-grams may be sparse n-grams or non-sparse n-grams. 

2. The automated method for setting up a natural language interface in a 
Web site recited in claim 1, wherein the step of generating a keyword 
index comprises the step of extracting sparse n-grams of keywords for each 
group of pages in the topic hierarchy. 

3. The automated method for setting up a natural language interface in a 
Web site recited in claim 1, further comprising the step of optionally 
reviewing and editing the keyword index. 

4. An automated method for setting up a natural language interface in a 
Web site comprising the steps of: 

automatically inducing a topic hierarchy by examining a structure 
of the Web site; 

creating n-grams fi-om pages in the Web site that are associated 
with a topic in the topic hierarchy and wherein the n-grams may be sparse 
n-grams or non-sparse n-grams; and 

creating rules from the n-grams, wherein each topic has associated 
rules that are used to decide if a new input document or query references 
the topic. 
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1 5. The automated method for setting up a natural language interface in a 

2 Web site recited in claim 4, wherein the step of creating rules is performed 

3 automatically and further comprising the optional step of manually editing 

4 the rules. 

New claim 6 is as follows: 
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6. The automated method for setting up a natural language interface in a 
Web site recited in claim 1 , further comprising the step of converting the 
set of n-grams to classification rules. 



